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Abstract 

A multi-player competitive Dynkin stopping game is constructed, extending the work of 
Guo and Rutkowski [6J. Each player can either exit the game for a fixed payoff, determined 
a priori, or stay and receive an adjusted payoff depending on the decision of other players. 
The single period case is shown to be "weakly unilaterally competitive" (Kats and Thisse [Io| ] 
and De Wolf 0]). We present an explicit construction of the unique value at which Nash and 
optimal equilibria are attained. Multiple period generalisations are explored. The game has 
interpretations in economic and financial contexts, for example, as a consumption model with 
bounded resources. It also serves as a starting point to the construction of multi-person financial 
game options. In particular, the concept of optimal equilibria becomes pivotal in the pricing of 
the game options via super-replication. 

Keywords: Dynkin game, stopping game, n-player game, optimal equilibrium, game option. 
AMS Subject Classification: 91A06, 91A10, 91A15, 9IA50, 60G40. 



1 Introduction 

A Dynkin stopping game, first introduced by Dynkin [3j, is a zero-sum, optimal stopping game 
between two players where each player can stop the game for a payoff observable at that time. Much 
research has been done on this as well as its related problems, e.g., 0, 0, HI, [H , 14, 16, 1?], [HI flij . 



One application of Dynkin games is in game contingent claims, or game option, as defined by Kifer 



. who proved the existence and uniqueness of its value. Further works, such as Hamadene and 
Zhang Q and Kallsen and Kuhn Q, studied various techniques in its pricing. 

Various formulations of multi-player Dynkin games exist in literature. For example, Solan and 



Vieille 17] introduced a quitting game which terminates when any player chooses to quit, then 
each player receives a payoff depending the set of players quitting the game. Under certain payoff 
conditions, a subgame perfect uniform e-equilibrium using cyclic strategies can be found. In Solan 
and Vieille fj~sj ] . another version is presented, in which the players are given the opportunity to stop 
the game in a turn-based fashion. A subgame perfect e-equilibrium was again shown to exist and 
consisted of pure strategies when the game is not degenerate. Hamadene and Hassani Q presented 
a non-zero sum stopping game where each player has his own separate payoff processes. These 
processes are independent of the other players' decisions, so in the event where a player does not 
stop first, his payoff does not depend the exact set of players who stopped. 

We aim to generalise the Dynkin stopping game to more than two players in a natural way 
which allows for the construction of a multi-person financial game option. Guo and Rutkowski 
Q introduced a zero-sum, simultaneous Dynkin game, with a focus on designing the dependencies 
between the payoffs of all players and their stopping decisions. In effect, it is modelling a multilateral 
"contract" where all the players are competing for a fixed total sum of wealth. Each player can either 
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exit or terminate the contract for a predetermined benefit, or do nothing and receive an adjusted 
benefit, reflecting the discrepancies caused by any exiting decisions. These adjustments ensure that 
the total wealth is fixed. 

This paper extends the results of Guo and Rutkowski [6] to games which are not necessarily 
zero-sum, but still retains the "weakly unilaterally competitive" (or WUC) property introduced by 
Kats and Thisse [10| and De Wolf [2| . As discussed Pruzhansky [15] , Nash equilibria are not always 
adequate as a solution concept. This certainly occurs in the valuation of the game options, as the 
Nash equilibria payoffs cannot be guaranteed. Instead, we formally introduce the stronger "optimal 
equilibria" , or Nash equilibria with maximin strategies, which induces a unique value for the game. 
The WUC property ensures that all Nash equilibria are also optimal equilibria. 

The main results of the paper are Theorems 13.71 and 14.01 which proves the existence of the value 
by explicit construction, and expresses it as the projection onto a simplex under an appropriate 
choice of inner product. The construction also produces a pure strategy optimal equilibrium. 

Several extensions are discussed. All single period results can be immediately applied to the 
stochastic case where both terminal and exercise payoffs are random, as long as expectations are 
incorporated into the definitions of solution and equilibria. Also, two multiple period generalisations 
are studied, including a recursive stopping game and a quitting game. The recursive stopping game 
can be readily applied to multi-person financial game options, where the properties of the optimal 
equilibrium become imperative in the pricing arguments. Details will be presented in an upcoming 
paper. The quitting game is a variant which cannot be stopped early, but each player can choose 
to quit at any time. Optimal equilibria are constructed for deterministic case and subgame perfect 
optimal equilibria are constructed if the game is perfect information. 

Apart from multi-person financial game options, the game presented here may be interpreted in 
other economic and financial contexts, for example, as a consumption model with bounded resources. 
It serves as a starting point to a particular class of competitive multi-player games. Many more 
multiple period and continuous time generalisations are possible, and they are under further research. 

Section [5] of the paper establishes some preliminary results in game theory and introduces the 
optimal equilibrium. Section [3] constructs the single period game and proves the existence and 
uniqueness of the value. Section [4] revisits the results of Guo and Rutkowski 0] and applies them 
to non-zero sum settings. The value is constructed using projection. Section [5] applies the results to 
the stochastic case, and then examines two multiple period possibilities. 

2 Optimal Equilibrium 

This section will discuss several game theory concepts, while keeping track of two main focuses: 
To find solution concepts applicable to the pricing of financial game options, as well as conditions 
characterising the idea of competitiveness in multi-person games. 

Consider a game Q with m players, enumerated by the indices 1,2, ... ,m. The set of all players 
is denoted by M.. Each player k can choose a strategy Sk G S k , and the m-tuples of strategies 
s = [si, . . . , s m ] s S are strategy profiles. Given a strategy profile s, it is possible to compute a 
vector of payoff functions V(s) = [Vi(s), . . . , V m (s)] for the players, with larger payoffs being more 
desirable. 

As introduced by Nash [ujj], a strategy profile s* G S is referred to as a Nash equilibrium, 
or simply an equilibrium, if no single player can improve his payoff by altering his own strategy. 
Formally, for each k 6 M, 

V k ( [4 , st k ] )>V k ( [s k , s*_ k ] ) , Vs k eS k . 

A Nash equilibrium represents a state which no player would deviate from. It gives some intuition 
to the value of the game, but in the context of game options, there are several deficiencies. The 
valuation of financial options involves replication of payoffs. But in general, this is not possible 
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without knowing the action of the other players. One cannot assume that the other players will 
converge towards Nash equilibria, so the equilibrium payoffs are not guaranteed. Furthermore a 
game may have several equilibria leading to different payoff values, and it's not always clear which 
one should be chosen. 

We formally introduce a new, stronger concept to address these issues. 

Definition 2.1 (Optimal Equilibrium). A strategy profile s* € S is called an optimal equilibrium 
if, for each k E M, 

V k ([«£,a- fc ]) > Vfc ([s* k ,s*_ k ]) > V k ([s k ,s*_ k ]) , Vs fc e S k , Vs_ fe G S_ fe . 

An optimal equilibrium is essentially a saddle point. It has the properties of a Nash equilibrium, 
with the addition that each player can guarantee a lower bound on his payoff without knowing the 
actions of other players. In other words, it re plic ates the properties of a Nash equilibrium with 
maximin strategies, as discussed in Pruzhansky [15| . This is crucial in the context of a game option 
as it allows for super-replication. Furthermore, as shown in Proposition 12.31 all optimal equilibria 
achieve the same value. 

Definition 2.2 (Minimax, Maximin, Value). 

• The maximin value of player k is the maximum payoff he can guarantee. 

V k = max min V k ( [s k , s_ fe ] ) 

• The minimax value of player k is the lowest payoff that the other players can force upon him. 

V k = min max V k ([s k ,S- k ]) 

s-t£S- t s fc GSfc 

• In general, V k > V_ k . If equality is achieved, then V k * = V k = V_ k is the value of the game for 
player k. 

In general, a game may not have a value. But the existence of an optimal equilibrium guarantees 
one. 

Proposition 2.3. Let s* be any optimal equilibrium. Since 

Vk ([s* k ,s*_ k ]) = max V k ([s k ,s*_ k ]) > V k 

«fc to], 

>V k > mm V k ([8t,8- k ])=V k ([ 8 %,8*_ k ]), 

all expressions are equal. Hence every optimal equilibrium attains the value of the game for all 
players 

V(s*) = V* = [V*,...,V£\. 

It is easily shown that any pure strategy Nash equilibrium is still a Nash equilibrium in the mixed 
strategy game. The same argument can be readily applied to show that any pure strategy optimal 
equilibrium is also an optimal equilibrium if mixed strategies are allowed. Hence the value of a pure 
strategy game is also the value of its mixed strategy extension. 

In two person zero-sum games, optimal equilibria are equivalent to Nash equilibria. In fact 
Proposition 12.31 simply reduces to John von Neumann's minimax theorem [20( . The theorem states 
that in any two person, zero-sum game with finite strategies, there exists a unique payoff where 
mixed strategy Nash equilibria are achieved. This unique payoff is also the value of the game. 

In multi-player zero-sum games, the payoff of any particular player is not sufficient to determine 
the other individual payoffs. Nash equilibria are not necessarily optimal equilibria, and they may 
not achieve the same payoff. However, a result similar to the minimax theorem exists if the zero-sum 
condition is strengthened to weakly unilaterally competitive, as described in Kats and Thisse (Iol | and 
DeWolf0. 
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Definition 2.4 (Weakly Unilaterally Competitive). A game is said to be weakly unilaterally com- 
petitive (or WUC) if for any k,l G M: 

V k ( [s k , s_ fe ] ) > V k ( [4, s_ fe ] ) =^ V t { [s k , s_ fc ] ) < V t ( [4, s_ fc ] ), 
V fc ( [ Sfe) ) = U fe ( [s' k , sj] ) =^ V t ( [s fe) S _ fe ] ) = Vi ( [4 , s_ fc ] ) 

for all Sfc, s' k G S fc and s_fc £ 

WUC explicitly quantifies the concept of competitiveness. If a player deviates from a strategy 
profile, any changes to his payoff is opposite in sign to the changes of other payoffs. Both Kats 
and Thisse (Iol | and De Wolf Q proved the following result, an analogue of the minimax theorem in 
multi-player, WUC settings. 

Proposition 2.5. In a WUC game, any Nash equilibrium is also an optimal equilibrium. 

WUC is not a necessary condition for the existence of optimal equilibria, but it eliminates 
the possibilities of Nash equilibria achieving multiple values. It is a desirable condition for the 
construction of our game. 

Finally, this paper will only focus on pure strategy games. As shown in Ferenstein Q, a stop- 
ping game with mixed or randomised strategies (in this case, randomised stopping times) can be 
reformulated as a stopping game with pure strategies with an appropriate filtration enlargement. 
Furthermore, in the context of evaluating game options, it is not practical to implement mixed or 
randomised strategies during payoff replication. 



3 Single Period Deterministic Games 

Throughout this paper, the game option terminology of "exercise" will be utilised when referring to 
the stopping or quitting of the game by the players. The corresponding payoff from doing so will be 
called "exercise payoffs" . 

Before proposing a multi-player variant, it is useful to recall the mechanism of a two person 



game option, as defined in Kifer 11[ . Essentially, the game option is a contract where the buyer can 
exercise the option at any time t for a payoff X t , while the seller can cancel (or also "exercise") the 
option at any time t for a cancellation fee of Y t . If no one does anything, the contract will expire at 
time T with the buyer receiving Xt from the seller. 

Remark 3.1. It is common to postulate that the inequality X t < Y t holds for every t. In other 
words, the cancellation fee should always at least as great as the exercise payoff. This circumvents 
the need to deal with simultaneous exercise for the purpose of valuation. When the buyer exercises, 
it will cost the seller at least as much if he also cancels. Similarly when the seller cancels, the buyer 
can only lose by exercising. If the players are exercising optimally, simultaneous exercise only occurs 
when the equality X t — Y t is true, in which case the payoff is still well defined. 

The assumption of X t < Y t can however be removed if we simply add another rule. If exercise and 
cancellation occur simultaneously, then the buyer and the seller simply receive X t and Y t respectively. 
This offers a cleaner mechanism which can be used in non zero-sum games, yet still produces the 
same value if X t < Y t holds. 

When there are more than two players, symmetry needs to be introduced between the buyer and 
the seller. The cancellation fee process can be interpreted as a negative exercise payoff process for 
the seller. So each player k has his own exercise payoff amount X k . And in the event that no player 
exercises, each player should receive a terminal payoff P k . In the two player version, the terminal 
payoffs Pi and P2 correspond to Xt and — Xt- 

Remark 3.2. There are various ways to generalise the exercise mechanism. In the two player 
version, when one person exercises, the entire effect (or "cost") of that action, whether positive of 
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negative, is paid by the other, non-exercising player. This paper will focus on a natural extension of 
that for multiple players, in which the effect of exercise is reflected in the payoffs of the non-exercising 
players, according to some weight function. 

We begin by setting up a single period deterministic game with m players, where exercising is 
only allowed at one predetermined time. 

Definition 3.3. A single period deterministic multi-player game Q, with players indexed by M. = 
{1,2, ... , m}, is specified by the following: 

• The vector X = [X\ , . . . , X m ] , where X k is the amount received by player k if he exercises at 
time 0. 

• The vector P — [Pi, ... , P m ] , where P k is the amount received by player k if no player exercises 
at time 0; 



The rules of the game are: 



1. The strategy s k G S k of player k specifies whether player k exercises, where S k = {0, f } is 
the space of strategies. In particular, Sk = means that player k exercises at time 0, whereas 
s k = I means that player k does not exercise. 

2. Given a strategy profile s 6 5 = IligA'!'-'*' ^ ne exerc ^ se se ^ denoted by £(s), is the set of 
exercising players. 

3. For each strategy profile s, the outcomes of the game Q are represented by the payoff vector 
V(s) — [Vi(s), . . . , V m (s)}, where Vfe(s) is the payoff received by player k if a strategy profile 
s is carried out. It equals 



V k {s) 



where 



X k , k e £(s), 

P k -w k (£(s))D{s), keM\£{s), 



D(s) = £ (X< - P <) 
ies{s) 



is the difference due to exercise and w k (£(s)) is a weight function which will be chosen in 
subsection 13.11 (cf. Definition [ 



The payoffs V k (s) are linear functions of and Pi. Most results of the deterministic case 
can hence be easily extended to stochastic settings, which will be covered in subsection 15.11 The 
difference due to exercise D(s) represents the effect of exercise described in Remark |3.2[ and they 
negatively affect the payoff of the non-exercising players if the weights are positive. As seen later in 
Theorem 13.41 the positivity of weights will be an important condition for a WUC game. 

If Yli&s w i{£) = 1 is satisfied for every £ C M, then the sum of all payoffs is either J2ieM 
(everyone exercises) or J2i£M l eas t one player doesn't exercise). This correspond to the 

property of the two player zero-sum game described in Remark 13.11 In fact, the two player game 
can be recovered by setting u>i({2}) = ^({l}) = 1. 



3.1 Choice of Weights 

In order to proceed further, we need to be more explicit about the way in which the weights are 
specified. In general w k {£) is defined for non-empty subsets £ C M, £ ^ M and k ^ £ . Consider 
the class of games with fixed weights w k (£) but with all possible choices of X and P. The goal of 
this section is to find weights achieving two conditions: 
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1. Every game in the class is WUC. 

2. Every game in the class has at least one equilibrium in pure strategies. 

The motivation is to separate the exercise mechanism, driven by Wk (£) , from the payoff parameters 
X and P. Thus the game retains nice properties irrespective of the choice of X and P. This is 
especially useful in stochastic cases and game options. 

Furthermore, we will assume that Wk{£) ^ for all non-empty subset £ C Ai, £ ^ Ai and k ^ £ . 
This assumption ensures that the decision of any exercising player will always affect the payoff of 
a non-exercising player. It also eliminates various degenerate cases. The WUC condition further 
refines the restrictions on the weights, as shown below. 

Proposition 3.4. Fixing the weights Wk{£) ^ 0, the game Q is WUC for all choices X and P if 
and only if the weights can be written in the following form: 

Wk(£) = , where > and < 1 for all k. (1) 

Proof. See Appendix IA.1I □ 



In (TTJ, the term ak indicates how much player k is affected by the exercise decisions of others. The 
relative size of ak determines the relative size of the weights Wk{£), used to redistributing D(s). By 
Proposition 13.41 the game with weights defined by (TTJ is always WUC, fulfilling the first condition. 
For the second condition of always having a pure strategy Nash equilibrium, further restrictions are 
needed. 

Proposition 3.5. Using weights defined by (TTJ), if^2 ieM ai > 1, then there exist X and P such that 
no equilibrium exists in pure strategies. In particular, X = [0, 0, . . . , 0] and P = [1, — 1, 0, . . . , 0] . 



Proof. See Aooendix lA.il □ 

Combining Propositions 13.41 and 13.51 we arrive at the following definition. 

Definition 3.6 (Weights). For each non-empty subset £ C Ai, £ ^ Ai and k ^ £ , define the weight 
w k (£) by 

wk{£) = - — ^ 

where a%, ... , a m are real numbers satisfying ak > and X^feex fl fc < 1- In particular, < Wk{£) < 1. 



Definition 13.61 will be used as the definition of weights for the remainder of the paper. As shown 
later in Theorem 13.71 the game with these weights always has a value, achieved by all Nash and 
optimal equilibria. 



3.2 Existence of Value 

With the weights determined by Definition 13.61 the game in Definition ^. 3l is now fully specified. The 
main result of this subsection, as well as this paper, is to show that the game always has a value. 
Recall that the uniqueness of the value was already established in Definition 12.21 

Theorem 3.7. The game Q from Definition VJ.cA has at least one optimal equilibrium. Hence it has 
a value. 

Before proving Theorem 13.71 a couple of preliminary results are required. Firstly, Lemma 13.81 
shows that the subgames of Q retain the same structure as Q. This is a consequence of Definition 
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In the game £/, consider a subset £ <Z Ai and assume that every player in £ exercises at time 
0, while the players from A4' = M\£ are still free to make choices. The possible outcomes of the 
game for the players from Ai' define a subgame of G, which we denote by Q' . It is possible to show 
that the Q' behaves similar to Q, but with modified parameters. 

Lemma 3.8 (Subgame). The subgame Q' is equivalent to a game Qm' specified by the following: 

• Set of players Ai' = M\£; 

• X' k = X k , for k £ Ai'; 

. P' k = P k - w k (£) Z ie£ (Xi - Pi), for k £ Ai'; 

• Weights defined by a' k — w k (£) = 1 _y? fc for k £ Ai' ; and 

• Strategy profile s' = [s' k , k £ Ai'] £ Sm 1 ■ 

Proof. See Appendix I A. 2 1 □ 



The property of the subgames proven in Lemma 13.81 is useful because it allows for the use of 
induction on the number of players. Furthermore it allows us to extend any equilibrium results to 
subgame perfect equilibrium results (for example, in section f5. 31) . 

We also require Lemma 13.91 which asserts that the difference due to exercise has to be positive 
under any Nash equilibrium. 

Lemma 3.9. If s* is an equilibrium then D(s*) = Y^ies(s*)(Xi ~ Pi) > 0. 

Proof. Intuitively, if Ylie£(s*)(Xi — Pi) < 0, then there exists a player k £ £s* who can do better 
by not exercising. For details see Appendix I A. 2 1 □ 

Now we are ready for the proof of Theorem 13.71 

Theorem \ 3.7\ By Proposition 12.51 in a WUC game, all Nash equilibria are optimal equilibria and 
attain the same value. So it is sufficient to construct a Nash equilibrium in pure strategies. 

We proceed by induction on the number of players. Note that the game is still well-defined as 
a single person game when m = 1, and all single person games have at least one equilibrium. In 
particular, s* = [1] if Pi > X 1 or s* = [0] if Pi < X x . 

Consider m > 2. If Pj > Xi for all i, then s* — 1 is an equilibrium. If P k < X k for some 
k, consider the m — 1 player subgame G{- k }- Let s' be an equilibrium of G{- k }i which exists by 
the induction hypothesis. Consider the strategy profile s* — [l,s_/. = s'], we will show it is an 
equilibrium of Q. 

By construction, s' is an equilibrium of j so any player i ^ k cannot improve his payoff by 

changing strategy. Hence it is sufficient to check that player k cannot improve by not exercising, or 
V k {s*) = X k > Vfc(s) = P fc - w k {£)D(s) where s = [0, s_ fc = s'] and £{s') = £{s) = £. Now write 
D(s) in terms of the subgame G{~ k } variables P[ and D'(s') (as defined in Lemma T3.8P 

D(s) = - Pi) + (P! - P) = D'(s') - £ Wl ({k})(X k P k ). (2) 

Substituting ([2]) back, we want the following expression to be non-negative, 
X k - (P k - w k (£)D(s)) 
= X k -P k + w k (£)D'(s') - w k (£)(X k - P k ) J2 M{k}) 

= w k (£)D'(s') + (X k - P fe ) (l - w k (£) ^ M{k})) ■ (3) 
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Applying Lemma [223 to the subgame G{k}, D'(s') > 0. Also recall Wk{£) > 0, (X^ — Pfc) > by 
assumptions. It remains to check the last term of ([3]), which can be written as 



a k Hies a * 1 - a fe _ ^2ies 



1 - w fc (£) > ^({fc}) = 1 - = = T — fft 



(4) 



Finally, Q is non- negative because dfc + J2ies a i — J2i£M a * — 1- So we indeed have Xk > Vk(s). 
Therefore s* is an equilibrium and the induction is complete. □ 

An algorithm of finding an equilibrium s* follows from the proof of Theorem 13.71 by repeatedly 
identifying players with Pj < X, as exercising players, and reducing the problem to a smaller case. 
The equilibrium and its value are found within m — 1 iterations. 



4 Construction of Value 



In section [31 a multi-player game Q is introduced to model a multilateral contract, where the payoffs 
have an implicit competitive structure. Theorem 13.71 established that Q has a value, using an 
inductive argument. In this section, Theorem 14.91 will give an explicit construction of the value. The 
construction is motivated by Guo and Rutkowski 0], which discussed a special case of Q which is 
zero-sum, and expressed the value in terms of a projection. Subsection 14.11 will briefly revisit these 
results, then subsection 14.21 will extend them to the general case. 

We begin by identifying the payoffs as as vectors in R m and introducing the following notations. 
Definition 4.1 (Hyperplane and Orthant). For any subset £ C Ai, define the hyperplane Hf as 

U £ = jx e R m : x H = Xi, Vi G £ j. 

In particular, H0 = R m and H» = X. The hyperplane contains all the possible payoffs if all 
players in £ exercise. Also define the orthant O as 

O = jcc e R m : x t > Xi, 1 < i < ml. 

The boundary of O are the hyperplanes M^y 

Proposition 4.2. The value V* of the game Q must lie inside the orthant O. 



Proof. Given any equilibrium s* , the payoff of each player should be at least as great as his exercise 
payoff, or V k (s*) > X k for all k e M. Hence V* G O. □ 

Remark 4.3. Given a normed vector space R m , for any vector P and any closed convex set K, denote 
by 7Tk (P) the projection of P onto K. In other words, 7Tk (P) is a vector satisfying 7Tk (P) G K and 

||7r K (P)-P|| < \\Q-P\\, VQ G K. 

For the normed spaces discussed in the following subsections, the existence and uniqueness of the 
projection are well-known. 



4.1 Zero and Constant Sum Cases 

This subsection will only present the main definitions and results without proof. A detailed treatment 
can be found in Guo and Rutkowski \i» . Note that analogous results for the non zero-sum cases will 
be presented in subsection 14.21 (cf. Proposition 14.81 Theorem 14.91 and Remark |4. 10[) . 
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Only in this subsection, add the following constraints to Definition ^. 61 the definition of weights: 
E°* = 1 => E^( g )= i_v £ai =1 V£cM,£^0. (5) 
Denote the resulting game by Q°. If £ (s) = M., then ^2 ieM Vj(s) = J2ieM -^*- Otherwise, 

E = E * + E ( p * - ) E^ - ^: 
=E^+E^-E(^- p i) = E p - 

i££ j'Gf ie^Vf 

So the payoffs of Q° have a constant sum, except when everyone exercises. All possible payoffs except 
one lie on the hyperplane 



x G 



E^ = E p 



In particular, 5° is almost zero sum if Eie.M P * = ^- Next, for any proper subset £ C M, define 
the hyperplane 



l° £ = M £ n H° 



{m m x 

cc G K m : = Vt G £ and E Xi = E p f' 
i=l i=l > 



In particular H§ = H°. This definition is not adequate when £ = M, because I» n H° = X n H° 
is empty unless J2 ieM Xi = J2ieM p - But ^ or completeness we further define = Mm = X. 

Definition 4.4 (Inner Product and Norm). Endow R m with the following inner product and norm 

m / \ / m 2 \ ^ 

= £(^), imi°-(e|) ■ 

i—l x 7 / 

The choice of norm in Definition ^. 4l will be justified by Proposition ^. 51 which gives an alternative 
yet elegant way of representing and computing the payoff vector, provided that the exercise set is 
known. 

Proposition 4.5 (Payoff as Projection). For any strategy profile s G S, the payoff vector V(s) 
equals 

V(s) = M(s) (P) 
where the projection it is taken under the norm \\-\\° ■ 
Consider the simplex § given by the formula 

x G M m : Xi > X u 1 < i < m and E x * = E Pl f ' 

i=l i—l ' 

There are a few possibilities: 

• If E. iG M Xi > J2ieM p > s is empty-; 

• If E ie M = J2ieM Pi, § = M° M = X is & single point; 

• If J2ieM < J2ieM fl, § is an in - 1 dimensional simplex, whose faces are § n as A 
varies over the subsets of A4. 
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As mentioned in Proposition 14.21 the value V* must lie inside O. Since the all payoffs of Q° but one 
lie on H°, we expect V* to lie in S (if it is non-empty). The following theorem shows that V* is in 
fact the projection of P onto S. 

Theorem 4.6 (Value by Projection). In game Q , ifJ2ieM X i — J2ieM p > ^ e va ^ ue ^* * s 

V* = V(s*) = ir s (P) 

where the projection it is taken under the norm ||-|| from Definition \4-4\ An optimal equilibrium 
s* = [sj, . . . , s* n ] is given by 

4-0 <=► [nsiP^^Xi. 
V^2ieM > ^2i£M the value is V* = V(s*) = X with an optimal equilibrium given by s* = 0. 

The two cases in Theorem 14.61 resemble the phenomena described in Remark 13.11 for two player 
game options. In the first case, J2ieM < Y^ieM^ lB analogous to the condition X t < Y t . On 
the other hand, if J2 ieM Xi > J2 ieM Pi, the game becomes degenerate and every player exercises. 
The effect is similar to the case of X t > Y t in two player game options. 



4.2 Non Zero-Sum Cases 

This subsection extends the projection representation to cases where Q is not constant (or zero) 
sum, or YlieM a i < 1- The key idea is to construct an equivalent constant (or zero) sum game by 
introducing a dummy player. 

Given an m player game Q with ^2 ieM di < 1, construct anm + 1 player game Q by adding a 
dummy player with the following attributes: 

• No exercising allowed; 

• P m +i is arbitrary, but for the sake of simplicity set P m +i = — Siex ^> so ^° i s zero-sum; 

• Weights w m+ i(£) = CH where a m+1 = 1 - J2ieM a * > °! 

• Payoff function V m+ i(s) = P m+1 - w m+ i(£(s)) Y,ie£( s )( X i ~ P i)- 

The space of strategy profiles in Q° is the same as S from Q, because player m + 1 cannot make any 
choices. The payoffs of players in A4 do not change in Q°. Any Nash/optimal equilibrium for Q° is 
also a Nash/optimal equilibrium of Q and vice versa. Finally the value of Q equals the value of Q°, 
after restricting to the first m coordinates. 

Since Q° is zero-sum, the notations from subsection 14.11 can also be used here. Furthermore Q° 
has the advantage of the dummy player not being able to exercise, so the game is always zero-sum. 
In other words, V m +i(s) + J2 ieM Vi{s) — for all s e S. Hence the space of payoff vectors for Q° 
is the hyperplane H° = {x : £j = 0} C R m+1 , endowed with the inner product (•, -)° from 

Definition 14.41 

(x, y)° = y( ^) + ( ~ Xl) ( ~ Vi) . (6) 

~[ \ a i J a rn+l 

Rewriting a m+ i in terms of ai, . . . , a m , © motivates the following inner product and norm for W 71 , 
the space of payoffs for Q. 

Definition 4.7 (Inner Product and Norm). Endow W 1 with the following inner product and norm 

i=i v a 
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Consider the isometry (j> : H° — > R m which simply discards the (to + l)-th coordinate. It maps 
Hjl = H4 n H° and S = O n H° to H4 and O respectively. Equipped with the new inner product 
(•, •), we can discard the extra player, and return to R m and Q. Proposition 14.51 and Theorem 14.61 
can now be analogously formulated for the general case, as Proposition 14.81 and Theorem 14.91 The 
proofs are in Appendix IA.3I 

Proposition 4.8 (Payoff as Projection). For any strategy profile s G <S, the payoff vector V(s) 
equals 

V(s) = n Me(s) (P) 
where the projection tt is taken under the norm \\-\\. 

Theorem 4.9 (Unique Value by Projection). In the game Q , the value is given by 

V* = V(s*) = [VKO, • • - > V m (s*)} = tto (P) 

where the projection tt is taken under the norm \\-\\. An optimal equilibrium s* = [s*,...,s£j is 
given by 

4-0 [ir®(P)]. = Xi. 

The existence of the value was first established in Theorem l3.7l Theorem l4.9l reaffirms that result 
by providing an explicit representation using a concise notation. The chosen optimal equilibrium s* 
is determined by identifying the exercise set £(s*) from the hyperplanes used in projection ttq, (P) — 

TTn^.j (P). 

Remark 4.10. If the original game Q is already constant (or zero) sum, then the introduction of 
player to + 1 is problematic. Division by zero occurs because a m+ i = 1 — ^2 ieM ai = ^> so » ' (") 
and ||-||° from subsection 14 . 1 1 are no longer well defined. However, Proposition 14. 81 and Theorem 14.91 
can still be applied to the constant (or zero) sum case if the following conventions are adopted. 

Consider the m + 1 player game Q e , where a%, . . . , a m and a m +i are replaced by 

a\ = Oi - — , i e M; < n+1 = 1 - V a\ = e. 

ieM 

Take Q° to be the limit of Q e as e — >• 0. Since for any fixed k £ M. and any fixed s £ 5, the C/ £ payoff 
Vjf(s) is continuous (linear, in fact) in e, the limit VJ?(s) is indeed the desired Q° payoff. Furthermore, 
Vf k — max Sfc min s _ fc V fe € ([sfe,s_fc]) and V\ = mm s _ k max Sf . V£ ([sfc,s_fc]) are also continuous in e, 
so the value of Q e converges to the value of Q a as well. 

Once again, the payoffs and value of the m player game Q are obtained from Q° after discarding 
the (to + l)-th coordinate. Although ||-|| does not exist, Proposition 14.81 and Theorem 14.91 can still 
be recovered by redefining the projection tt. We formally adopt the following convention for the 
remainder of the paper. 

Definition 4.11 (Projection). When YlieM a ' 1 < ^' then define tt as the projection (see Remark 
14. 3[) under ||-|| from Definition 14.71 In the case of YlieM ai = ^' define tt for M.g,£ C Ai and O as 
follows, 

tt M£ (P) = lim 7r^ £ (P), vro (P) = lim TT e Q (P) 
where tt £ is the projection under the norm 

"•-(gfe^x)* 23 ^) 4 - n 

Intuitively the x i) 2 / e term in ([7]) serves as a penalty function, to keep the deviation 

of J>2i=i x i ^° a minimum during projection. In subsection 14.11 the same effect is produced by 
restricting projections to the zero-sum hyperplane H° (hence using and § instead of and 
O). The penalty function allows for a cleaner representation consistent with the general case. The 
notation of ttq (P) in Theorem 14.91 also conveniently eliminates the need for cases in Theorem 14.61 



12 



Unilaterally Competitive Multi-Player Stopping Games 



5 Stochastic and Multiple Period Extensions 

The goal of this section is to study various extensions of the single period deterministic game. We 
begin by presenting a straight forward single period stochastic version. For multiple periods, instead 
of only choosing between "exercise" and "not exercise" , each player also have to decide "when to 
exercise". These games are rarely WUC (cf. Definition ^. 41) . even if the single period building blocks 
are. Nevertheless, we attempt to identify generalisations where optimal equilibria and value still 
exist. 

5.1 Single Period Stochastic Games 

The stochastic game is very similar to the deterministic version from Definition 13.31 except the 
vectors X and P are not known at the time of exercise. All of the following definitions are taken 
under the probability space (f2, T , P). 

Definition 5.1 (Stochastic Game). A single period stochastic multi-player game Q, with players 
indexed by M. = {1,2,..., m}, is specified by the following: 

• The J 7 - measurable random vectors P = [Pi, ... , P m ], X = \X\, . . . , X m ]; 

• The weights Wk{£) = 1 _^' fc — — for k f £ C M, where ai > is deterministic and J2h=m a « — 
1. 

The rules of the game are: 

1. The players are only allowed to exercise at time 0, when P and X may not be known exactly. 
The payoffs are distributed at time 1. 

2. The space of strategy profiles is S — YiieM where Sk — {0, 1} is the space of pure strategies 
for player k with Sk — meaning player k exercises at time 0. The exercise set £ (s), s € S is 
the set of exercising players. 

3. For each strategy profile s, the outcome of the game is the expected payoff vector V(s) — 
[Vi(s), . . .,V m (s)], defined by 

14(s) = Ep(X fe l{ feG£ ( s )} + A fe l{ fc ^ £ ( s )}) 

where 

Xk = Pk-w k {£{s)) (Xk-Pk)- 

k££(s) 

By the linearity of the payoff function Vk (s), the stochastic game is equivalent to a deterministic 
game Q' starting at time with P' = E P (P) and X' = E r (X). Hence by Theorems |3~71 and K§[ 
the value of Q is given by: 

Proposition 5.2 (Value by Projection). The stochastic game Q has a unique value given by 

V(s*) = 7T (E P (P)) 

where O is the orthant defined by 

O = ja: e R m : x, > E P (X t ) , 1 < i < m\ 

and the projection ir is given by Definition ^ ■ 11\ A possible optimal equilibrium s* — [si, ... , sjjj is 
given by 

s-=0 <=> [ 7 r (E P (P))].=X i . 



I. Guo 



13 



5.2 Multiple Period Stopping Games 

There are various ways of generalising two player game option to multiple players while keeping 
the underlying dynamics. This paper will consider a game similar in dynamic to the one studied in 
Solan and Vieille [13], which stops as soon as any subset of players chooses to stop. We will use a 
payoff function generalising the single period game from Definition l3.3l with the goal of constructing 
optimal equilibria in pure strategies. The game will be recursive, in the sense that each player 
chooses between an exercise payoff and the value of a shorter game from the next period onward, 
ft is reminiscent of compound or nested financial options, and it reduces to the discrete time two 



player game contingent claim (cf. Kifer ll[) if m = 2 and a\ = 0,2 = 1/2. 

Using standard terminologies for stochastic processes, all of the following definitions are taken 
under the probability space (f2, T, P) endowed with the filtration F = {Ft : t = 0,1, ... ,T}, repre- 
senting the possible exercise times. 

Definition 5.3. For t = 0, 1, . . . , T, a multi-player stochastic stopping game Qt, with players indexed 
by Ai = {1, 2, . . . , m}, is defined on the time interval [t, T], specified by the following inputs: 

• The F-adapted processes X u — [Xi^ u , . . . , X mtU ], where u = t, . . . ,T; 

• The weights Wk{£) — 1 _^ tfc — — for k £ C M, where ai > is deterministic and J2 i£M ai < 
1. 



The rules of the game are: 

1. Each player can exercise at any time in the interval [t,T\. The game stops as soon as anyone 
exercises. If no one exercises before time T, then everyone must exercise at time T . 

2. The strategy Sk,t of player k is a stopping time chosen from the space S k ,t of F stopping times, 
in the interval [t, T]. The strategy profile St — [si,t? ■ • • , Sm,t ] £ S t is the m-tuple of stopping 
times. Denote s't — si.t A • • • A s TO ,t to be the minimal stopping time, also an F stopping time. 
The exercise set E (si) = {i € M. : s^j = s*} is the random set of earliest stopping players. 

3. For each strategy profile St, the outcome of the game is the expected payoff vector Vt(st) = 
[V M (s t ), . . . , V m ,t(st)], defined by 

Vk,t{s t ) = Ep(Xk,s t '&{ke£(st)} + Xk,s t l{fc££(s t )} | Ft) (8) 

where 

Xk,% = Vl %+1 - w k (S(s t )) Y, ( x i,s t ~V* tt+1 ), s t <T. (9) 

and V s* + 1 = [ V l,s t + V •••■> V m,s t +l) is the Value ° f the § ame 

As the game is stopped at time s t , the indicator functions in (jSJ separate the exercising players 
from the others. Xk,t t is the payoff for an exercising player while Xk : s t is the payoff for a non- 
exercising player. The game Gs t +i can be considered as the continuation of the current game 
if it does not stop at time si. Note that in (O, Xk,s t is not defined for 'st — T, but this does 
not matter because if the game is only stopped at T, every player must exercise and receive 
X kiT , not X k ,T. 

Definition 15.31 is recursive. Since % + 1 > t, the payoff of Qt may depend on the values of 
Qt+i, ■ ■ ■ ,Gt (which are themselves subgames of Qt). Intuitively, it is perhaps easier to view the 
stopping game as a sequence of single period games. If Qt is stopped at t, then the exercising players 
receive X kt t while the other players receive 

MKt+i I ?*) - M£) E ( x m - Ep(^+i \F)). 
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If Qt is not stopped at time t, then it continues on to become the game Gt+i- And finally the game 
Qt always stops at time T as everyone exercises. 

Proposition 5.4. The expected payoff of Q t , ^t( s t) = P4,t(st), • • • , Vm,t( s t)L can be represented 
using the projection as follows: 

V t (s t ) = E P (7T H£(st) (Vs t+1 ) t{s t <T} + X T t{s t =T} | F t ), 
where M.£t Si \ is the J-^ t -measurable hyperplane 



h(s t ) = |x G M m : Xi = Xift, Vz G £(«*) 



Proof. This follows immediately from Definition 15.31 and Proposition 14.81 □ 
Theorem 5.5. Recursively define the JF t -measurable vector Ut = \Ui.t, ■ ■ ■ , U m .t] by 

U t = 7r 0t (Ep(l/ t+ i | F t )) , forO<t<T, U T = X T (10) 
with Ot being the Ft -measurable orthant 

x G M m : x, > X u , 1 < i < mi, (11) 



and t/ie projection tt is given by Definition ^. 11\ Define the set of ¥ -stopping times t to be 

t = [n,...,T m ], where = inf {u G [t, T] : U iiU = X^ u } . (12) 

Then 

1. Ut = Vt(r) taking t as a strategy profile of Q t . and 

2. t is an optimal equilibrium of the multi-player stochastic stopping game Qt, hence Ut = V t * is 
the value of Qt ■ 

Proof. The statements are proven simultaneously by backward induction. In the case of t = T, 
t = [T, . . . , T] . The game Qt is always stopped at time T with the payoff vector Xt = Ut = Vr (t) 
also being the value. Now assume the statements are true for the game Qt+i, so its value is given by 

V t * +1 = U t+1 = V t+1 (r). (13) 

Note throughout the proof that if the game Qt is stopped at time t, then it is reduced to a single 
period stochastic game (cf. Definition 15. ip with payoff vectors X t and V t * +1 . Denote this single 
period game by Q' . Also write r = t\ A • • • A r m . 

(Statement 1) Case 1: If f = t, the game is stopped at time t. By Proposition 15.21 and (|T2"j). r is an 
optimal equilibrium of the single period game Q' , whose value is 

U t = tto, (E r (U t+l \F t ))= TT Qt (E P (V; +1 \ F t )) = tt H£(t) (E P (V t * +1 \ F t )) . 

The result then follows from Proposition [531 noting that Hg( T ) is Ft -measurable, 

U t = 7t H£(t) (Ep(V t * +1 \F t ))= E P (^ £(T) | Ft) = V t (r). 

Case 2: If r > t + 1, the game is not stopped at time t, then by (flQ)) . (TrT|) and (TT21) . Uij > Xi t 
and Ut lies in the interior of Ot . Applying the induction hypothesis (fT"3")) , 

U t = 7r 0t (E P (U t+1 \F t ))= E P (Ut+i | Ft) = E P (Vt+i(r) | F t ). 
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It is sufficient to show V t {r) = Ep(V*+i(t) | Ft), which is true by noting f > t + 1 and applying 
Proposition 15.41 

V t (r) = E P (E P (7r H£(T) (V 9 * +1 ) \ F t+ i) \ Ft) = E P (V t+1 (r) | F t ) if f < T, 
Vt(r) = E P (E P (X T | F t+ i) | Ft) = E P (V i+1 (r) | F t ) if r = T. 

(Statement 2) By Statement 1, (ITU)) and the induction hypothesis (fT"3"l) . 

VKr) = E7* = 7r 0t (E P (Ut+i | Ji)) = 7r 0t (E P (V r t ; i | Ji)) ■ 

To check r is an optimal equilibrium (cf. Definition I2.1[) . we require for each k € M, 

V klt ([r fc ,s- fe ]) > [tto, (Ep(V f * +1 | ^ t ))] fc > V3b,* ([«fc,r_*]) 

for all Sfc £ iSfc, s_fe E 5— Let s' = [r^, S-k] , s" — [sfc,T_jJ be alternative strategy profiles with 
minimal stopping times s',s y '. 

Case 1: If s' = s" = t, then both s' and s" can be interpreted as strategy profiles of the single 
period game Q' . Hence the result follows from Proposition 15.21 because [no t (Ep(V t * +1 \^t))] k is 
value of Q' for player k. 

Case 2: If s 1 > t+1, then s' is a valid strategy profile of Gt+i- Also we must have t% > t + 1 being 
a maximin strategy, because it belongs to an optimal equilibrium of Qt+i by (|12p and the induction 
hypothesis. From Proposition 15 .41 

y t (s') = E P (E P (7r H£(a;) (V£ +1 ) | J- t+1 ) | Ji) = E,(V t+1 (s / ) | Ji) if ? < T, 
Vt(s') = E P (E P (X T | F t+ i) | Ft) = E F (V m (s') | F t ) iis 1 = T. 

Using the fact that r k is a maximin strategy, 

V k , t (s') = E v (V w (s') | Ft) = E P (V M ([r fc , s_ fe ]) | 7i) > E P (V fc * t+1 | 7*). (14) 

Since Tfc > t + 1 , by (fl"2"|) we must have 

K (E P (V t * +1 |^t))] fc = K (E P (l/ t+1 |J- t ))] fc > X k . t . (15) 

By Proposition 15.21 and (fTS"|) , player A: does not exercise in the optimal equilibrium of the single 
period game Q' . Interpreting E P (V fe * t+1 | Ft) as the expected payoff of player k if no one exercises, 
and using the definition of optimal equilibrium, 

MKt+i I ^0 > K (E P (V7 +1 \F t ))] k . (16) 

CI and CFJ imply V k , t (s') > [vr 0t (E r (V t * +1 \ F t ))] k as required. 

Case 3: If 5" > t + 1, by arguments similar to the ones used for (|T4"]) 

U M (s") = E P (Vfc, t+1 ( S ") | Ft) < M V k,t+i I ( 17 ) 

For all i ^ fc, since n > t + 1, by flJJ) we have [ttq, (E P (y^ 1 | F t ))] i = [tto, (E P (t7 t+ i | F t ))] i > 
Xj, it . By Proposition [521 H i k, player i does not exercise in the optimal equilibrium of the single 
period game Q' . Again interpreting Ep(V k * t+1 | Ft) as the expected payoff of player k if no one 
exercises, and using the definition of optimal equilibrium, 

MKt+i I < K OW+i I ft)) ] k . (is) 

(HZJ) and (0 imply V M (s") < [ttq, (E P (F t ; l | J" t )) ] fc as required. 

Both statements are hence proven and the induction is complete. □ 
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It is possible to further generalise the game by making the weights (hence ai) time dependent 
and F-adapted. The weights at time s will be applied as the game is stopped. Theorem 15.51 will 
analogously hold, with the projection 7r also made time dependent. But for brevity that case will 
not be included here. 

Remark 5.6. The stopping game described by Definition 15.31 is perhaps not the most obvious 
generalisation of the single period game. A more natural generalisation would be for the non- 
exercising player k to receive 



as the game is stopped. That is, using X^,t instead of the value V k *~ +1 of Qt+i- But even in 
deterministic cases, this does not always produce optimal equilibria in pure strategies. For example, 
consider a game with X = [— 1,-1,0], Xi = [— 2,— 2,4], X 2 = [0,0,0] and a\ = a 2 = a 3 = 1/3. 
Player 3 will always want to exercise at time 1, while there is a prisoner's dilemma between players 1 
and 2 at time 0. This game has two Nash equilibria (with different payoffs) but no optimal equilibria 
in pure strategies. 

5.3 Quitting Games 

A quitting game is an alternative formulation of the multiple period case. As opposed to a stopping 
game, a quitting game does not end when one player stops or exercises. Instead, the non-exercising 
players continue the game and may exercise at a later date. We focus on the deterministic case here, 
as the stochastic case does not always produce optimal equilibria (see Remark I5.10[) . 

Definition 5.7. A deterministic multi-player stochastic quitting game Q, with players indexed by 
M = {1,2, . . . , m}, is specified by the following inputs: 

• The vectors X t = [X 1;t , . . . , X mit ], t = 0,...,T; 

• The weights Wk(£) = i_y^ fc — — , where a* > and ^2 i€M a% < 1. 

The rules of the game are as follows. If player k exercises at time t in the interval [0,T — 1] (or 
Sk = t), he receives a payoff of Xk,t- If player k does not exercise before time T (or Sk — T), he 
receives X^t — Wk{£(s)) Yliee(s)(Xi,t ~ ^Q,r) at time T, where £ (s) is the set of players exercising 
before time T and s — [s\, . . . , s m ]. In other words, the payoff vector V(s) = [Vi(s), . . . , V m (s)] is 
given by 



X k . T - w k (£(s t )) ( X i,t t - X k , T ), s t < T, 




Theorem 5.8. The quitting game Q has a unique value given by 



V* = ir® (X t ) 



where O is the orthant defined by 



O = s 6 I m : > max X itt ,l<i<m 



and the projection n is given by Definition \4- 11\ A possible optimal equilibrium s* 
given by 



— [ S l> • • • 7 S J 



ni 



IS 



s* = mm{t e [0,T] : ir® {X t ) l < A M } . 
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Proof. Consider a single period game Q' with X' k = maxo<t<T-i X k ,t and P k = X k ,T- By Theorem 
14.91 V* is the value of Q 1 . If player k exercises then ttq (X t ) = X' k = maxo<t<T-i X ky t- If player 
k does not exercise then V k * — X kiT — w k (£(s*))D(s*) < X k .r (recall D(s*) > from Lemma 131?)) . 
Either way s k is well-defined, and the exercise decision corresponds to the optimal equilibrium of 
Q' . Hence it is easy to check that V* = V(s*). 

Now to see s* is an optimal equilibrium of the quitting game Q, as per Definition 12.11 first we 
check that it is a Nash equilibrium, or 

Vi ([sl,st k ]) > Vk ([«*,***]) , V s k G S_ fc . 

If si < T and player k exercises. He cannot improve his payoff by exercising at another time 
since X ktS * k is maximal, neither can he improve by not exercising since s* corresponds to a Nash 
equilibrium in Q' . 

If instead s* k — T and player k does not exercise, it is sufficient to check that he cannot improve 
by exercising at any time, or V* > maxo<t<T-i Xk,t — X' k . This is certainly true since, once again, 
s* corresponds to a Nash equilibrium in Q' . Therefore s* is a Nash equilibrium of Q. 

To complete the proof, we are left to check 

V k ([4,s_ fe ]) > V k ([s* k ,s*_ k ]) , Vs_ fc G S- k - 

If s k < T and player k exercises, his payoff is fixed and cannot be decreased by the action of other 
players. If instead s k =T and player k does not exercise, write s = [sj£, s_fc] , then 

V k (s) = X h .T - Wk(£(s)) J2 ( X i,*i - X ^t) 

ies{s) 

> Xk.r - w k {£(s)) V* I max X i t - X i T I 

jL^ \o<t<T-l / 

ies(s) v 7 
= P' k - w k (£(s)) J2 ( X 'i ~ P D = V k( s ') (19) 

where s' is the strategy profile in which any exercising player under s chooses to exercise for maximal 
payoff X[ instead. But s' corresponds to a strategy profile in Q' with player k not exercising. Since 
s* is an optimal equilibrium of Q' , we have V k (s') > V k (s*). Combining with (IT91 . it implies 
Vfe(s) > Vfe(s*), as required. □ 

One thing to note about Theorem l5.8l is that it doesn't specify the amount of information available 
to the players regarding the exercise decisions of others. Unlike a stopping game, the strategies in 
a quitting game can also depend on the observable actions of other players. However, if we denote 
the total information available to player k over time by the filtration = {Fk,t)t = 0, . . . , T}, then 
the strategy s k is an stopping time. Theorem l5.8l shows that, in the quitting game, regardless of 
how much or little any player observes about the actions of others, the value of the game is fixed. 
An optimal equilibrium attaining the value can be chosen independently. 

Remark 5.9. If Q is a perfect information quitting game, that is, s, is an Wj stopping time for any 
i 7^ j, then a subgame perfect optimal equilibrium can be constructed. Denote the set of exercising 
player up to time t — 1 by £t-i(s) and the set of remaining players by M.t = M\£t-i{s). By Lemma 
13.81 the quitting game Gt on the interval [t, T] is a subgame amongst the remaining players A4t, but 
with following variable modifications: 

. X' k u = X kMl for t < u < T - 1, k g M t ; 

• X' k T = X k:T - Wk(£t-i(s))T,ie£ t _ 1 ( s )( x i,s i ~Xi,r), for k G M t ; 

• Weights defined by a' k = w k {£ t -i{s)) = 1 _y, ak — , for k G Mt; and 
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• Strategy profile s t = [sk,t, k G Mt] G S t - 

A subgame perfect optimal equilibrium s** is an optimal equilibrium for any reachable subgame 
of Q. In particular it is also an optimal equilibrium and attains the value V(s**) = V* . The optimal 
equilibrium s* constructed in Theorem 15.81 is not necessarily subgame perfect, as the strategy s£ 
of player k does not take the actions of others into account. For example, assume another player 
i deviates from s* by exercising too early. The strategy s£ does not adjust to punish the mistake. 
Even though the value of V£ is still guaranteed, player k misses the chance to guarantee an even 
higher payoff, created by the sub-optimal deviation of player i. 

The subgame perfect optimal equilibrium can be constructed recursively, by backward induction 
with respect to both time and remaining set of players. Specifically, for the quitting game subgame 
Qti the subgame perfect optimal equilibrium sjf* = [sjjl* t , k G Mt] is given by s" T = T and 

s k*t = t '^{*o t (X kl T) k =X kl t} + s k*t+l^{no t (X h , T ) k >X k , t }> 1 < T 

where Of = {xi > maxt< u <r_i Xi tU , i G Mt} is an R^*! orthant, the projection n is defined by 
Definition 14. 1 1 1 with modifications for Mt, and s£ t+1 is the player k subgame perfect optimal equi- 
librium strategy of the subgame Qt+i- Finally s** — Sq* is t ne subgame perfect optimal equilibrium 
of the perfect information quitting game Q. 

On the other hand, in an imperfect information quitting game, where the players' actions are 
partially or completely hidden to others, it is not always possible to determine the current subgame 
Qt- Therefore subgame perfect optimal equilibria may not exist. But by using the same idea 
as above, each player can construct a strategy which is the optimal equilibrium strategy in all 
observable subgames. And as mentioned, the lack of subgame perfection does not change the value 
of the quitting game. 

Remark 5.10. In the stochastic case, unfortunately equilibria may not exist in pure strategies. For 
example, a game with a\ — a 2 — a 3 = 1/3, T\ = Ti = {0, {oji}, {^2}, = P(w 2 ) = 1/2, and 

the payoffs given by X = [2.1,3.5,-50], Xi(wi) = [-50,-50,-5.05], X^) = [4,-50,-50] and 
X 2 {ui) = X 2 (uj2) = [0, 5, -5]. 



A Appendix 

A.l Proof of Propositions 13.41 and 13.51 

Before proving the Proposition 13.41 we require the following lemmas. 

Lemma A.l. Given a subset £ C M with < \£{s)\ < m — 2 and i,j £ £(s), let £' = £ U {j}. If s 
and s' are strategy profiles with £(s) = £ and £(s') = £' (hence player j exercises in s' but not s), 
Then 



(Vi(s) - Vitf)) + Wip'WjM - Vitf)) = {wi(£')[l - Wj {£)] - Wi {£)} D(s). (20) 

Proof. This follows directly from the definition of V, noting that D(s') = D(s) + (Xj — Pj). Start 
with 



V t (s) - Vi(s') = Wi {£')D(s') - Wi(£)D(s) 

= { Wi {£') - Wi(£))D{s) + WiiS'HXj -Pj), 
Wip'XWs) - Vjis')) = wt{e')(P s - Wj {£)D{s) - X,) 

= -w t (£') Wj (£)D( s ) - Wip'XXj - Pj). 

Adding the expressions yields the desired result. □ 
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Lemma A. 2. The game Q is WUC for all choices X and P. if and only if both of the following 
Conditions hold: 

1. For any £ C M, 1 < \£\ < m — 2 and i,j £ £, 

w i {£U{j})(l-w j {£))=w i (£); 

2. For any £ ' C M, 1 < \£'\ < m - 1 and i £' ', 

Wi(£') > 0. 

Proof. We prove the statement in three parts. 

(WUC Condition 1) Take s and s' to be strategy profiles with £ (s) = £ and £ (s') = £' = £ U {j}. 
If the game Q is WUC for all X and P, then 

^( S ) = ^( S / )^^(s) = ^( S ')- 

By Lemma [A. 11 (|20|) becomes 

{tui(£')[l-«>i(£)] =0. 
When |f | > 1, we can choose X, P so that D(s) ^ 0. Hence 

Wi {£')[l - Wj {£)\ =Wi{£) 

and Condition 1 is proven. 

(Condition 1 + WUC =>• Condition 1 + Condition 2) Now assume Condition 1 holds, hence Wi(£')[l— 
Wj(£)} - Wi(£) = for \£\ > 1. Note that if \£\ = 0, then D(s) = 0. In either case, ^ always 
simplifies to 

(V i (s)-V i (s'))=w i (£')(V j (s')-V j (s)). 

But the Q being WUC requires U,(s') - Vj(s) > => U t (s) - U t (s') > 0. Since the weights are 
required to be non-zero, we have Wi{£') > for all 1 < \£' \ < m — 1 and Condition 2 is proven. 

(WUC <^ Condition 1 + Condition 2) As before, Condition 1 quickly gives 

(V i (s)-V i ( S '))=w i (£')(V j (s')-V j (s)). 
When Condition 2 also holds, it's easily checked that the game is indeed WUC. □ 

Proposition \3.4\ It is sufficient to completely solve the system presented in Conditions 1 and 2 of 
Lemma [A. 2 1 Consider the case where m > 4. By Condition 2, we have < Wi({j}) < 1 for all i ^ j. 
By Condition 1, 

,y, Wj{{k}) Wi{{j}) = l~Wfc({j}) 

1-^({J» IJ l-^(W) «;,({*}) 1 -«;,■({*})• 

Since the only the left hand side depend on i, we have, for fc, / distinct, 

w»({j}) = wi({j}) 
Wi({k}) Wl ({k})- 

This system of equations has the parametric solution Wi({j}) — aibj, with ai,bi ^ 0. Substituting 
back, 

'lib, n,b,, 1 1 1 
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where c is a constant for all i. Solving for bj yields Wi({j}) = afij — - °' a . Scale all <2j by a factor 
of 1/c 

Substituting into Condition 1 while recursively incrementing the size of £ , we obtain 



Wk(£) = 



Condition 2 adds the restrictions > 0, X^fc o» < 1< This solution can easily be checked to always 
satisfy both Conditions 1 and 2. 

The cases for m = 2, 3 can be easily solved to obtain the same solutions. □ 
Proposition Iff. 51 First note that for i ^ j 

Wi ( M \{i}) = -^ >1. 

On the other hand, as before, Wi(£) < 1 if |£ < m — 2. 

We will simply check all the possibilities. If {3, ... , m) C £ or if there are only two players, then 
the cases are the following. 

• If both players 1 and 2 exercise, then V% = V% = 0, but player 2 can receive — 1+W2(M\{2}) > 
if he doesn't exercise. 

• If only player 1 exercises, then V\ = 0, but player 1 can receive Pi — 1 if he doesn't exercise. 

• If only player 2 exercises, then V\ = 1 — w\(M. \ {1}) < 0, but player 1 can receive X\ = if 
he also exercises. 

• If neither player 1 or 2 exercises, then Vi = —1, but player 2 can receive X2 = if he also 
exercises. 

If some player k ^ 1, 2 doesn't exercise, so k £ £, then the cases are the following. 

• If both players 1 and 2 exercise, then V\ — V2 — 0, but |£ \ {1}| < m — 2 and player 1 can 
receive 1 — w\{£ \ {1}) > if he doesn't exercise. 

• If only player 1 exercises, then \£\ < m — 2 and V2 = —1 + Wi(£) < 0, but player 2 can receive 
X2 = if he exercises. 

• If only player 2 exercises, then Vk = — Wk(£) < 0, but player k can receive = if he also 
exercises. 

• If neither player 1 or 2 exercises, then V2 = — 1, but player 2 can receive X2 — if he also 
exercises. 

In all cases, the pure strategy profile is not an equilibrium. □ 
A. 2 Proof of Lemmas Effl and [3791 

Proo/ of Lemma \3.8\ Let s' S be any strategy profile of the subgame Qm' and £' = £'(s') be 
the corresponding set of exercising players. Let s 6 5 be a matching strategy profile of the original 
game Q, so 

s = [s k = 4, k e M': s k = 0, k i M'} 
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and £(s) = £ U £' . It is sufficient to check that for any k € the payoff V^(s') of the subgame 
matches the payoff V k (s) of the original game. 

First note that the weights of Gm' can be written as 

fe() "i-E ie ,<"i-^^) (21) 



(23) 



= = — =5* = (22) 

If fc € then V^(s') = X' k = X k = V k (s). Otherwise, 

F fe v)=n E pa- p /) 

= P fe - t» fc (5) ^(Xi - Pi) - w' k (£') E I Xi - Pi + Wi(£) ^(Xj - P 3 
ie£ ieS' y je£ 

= p fe - ( + ^(^) e J2( x * - p J - w '^ E ') - 

Rearranging (f2T|) as w k (£) + ^4(0 Sief w i{£) — w ' k (£'), we can rewrite ((23)) as 
V fc V) = P k - w' k {£') J2(Xi - Pi) - w' k {£') Y,{Xi Pi) 

i£S ie£' 

= P k -w' k (£') ]T (X-Pi) 
ie£{s) 

= P k - w k (£(s)) £ (X, - Pi) by m 

ie£(s) 

= V3fe(a), 

as required. □ 

Lemma [XM Assume the contrary, so D(s*) < 0. Then there must exists a player fc 6 £(s*) 
with X k — P k < 0. Now if player k chooses not to exercise, his payoff will be V k {s') = P k — 
Wk(£(s'))YliE£(s')(Xi — Pi), where s' is the modified strategy profile. Note that s' = [l,s*_ k ] and 
s* = [0, st k ] and thus £(s') = £(s*)\{k}. Since s* is an equilibrium, we have V k (s 1 ) < V k (s*) = X k . 
Therefore, 

X k -V k (s')=X k -P k +w k (£(s')) E (Xi-Pi)>0. (24) 

ies(s') 

Since X fe - P k < 0, 423} implies E tG£ ( s ')( X > _ P i) ^ °- Reca11 in Definition EH < w k (£(s')) < 1. 
Therefore 

£>(»*)= £ (*i - -Pi) fib +«*(£(«')) £ (^- p 0>0, 

ie£(s*) ie£(s') 

contradicting the assumption of D(s*) < 0. □ 
A.3 Proof of Theorem [4791 

The following lemma is a standard result of projection in linear algebra. It is used throughout the 
other proofs. 
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Lemma A. 3. In R m , if IK is a hyperplane, the projection 7r is orthogonal, that is, 7Tk (P) is the 
unique vector in K such that 

(ir K (P)-P,Q-n K (P))=Q, VQel. 

Furthermore, if J is a convex subset of the hyperplane K, then 

nj (P) = ttj (ttk (P)) . 

Proof of Proposition ^. 8\ The vector V(s) 

V(s) = = X,, i g £(s), V5(a) = P ( - (s))P(s), i £ £{sj\ . 

certainly lies in the hyperplane 



l £{s) = jx G E ro : re, = X, for every i G £(s) j. 



So it is sufficient to check that 

v :=P V(s) = [ Vi =Pi- X t , i G £(«), «< = Wi(£(s))D(s), i £ £(s)] 

is orthogonal to M £ ( s y Let u := [u; = 0, i G £(s)] be any vector lying entirely in Hg( s ). Then (u, v) 
evaluates to 

UjWi(£(s))D(s) u i) (Eieg( fl ) ( p » - x i) + E»gg(s) ^(g(g))£>(5) 



z — ' a, 1 — > , a,- 

1 - L i= i a» 



1 - 






UjD(s) 


1 - 


SiG£(s) a i 




t £ (s) UiD(s) 


1 - 





(1-1) = 

as required. □ 

Before proving Theorem 14.91 a couple more lemmas are needed. 
Lemma A. 4. Assume that P G O. T/ien tth^ (P) G O /or any subset A C AL 

Proof. By Theorem 14.81 the projection 7Th a (P) corresponds to the payoff vector when .4 is the set 
of exercising players. Let s be the corresponding strategy profile. In particular, for any i G A 

[n BA (P)].=Xi<Pi. 

and thus D(s) — YlieA^* ~ — ^- Consequently, for any j G A - ! \ A 

and thus tt Ma (P) G O. □ 
Lemma A. 5. Let fc G A4. Ifno(P) ^ H//.} iften Pfc > Xfc. Equivalently, if Pk < Xfc f/ien 

7r (P)eH {fc} . 
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Figure 1: P k = X, 



Proof. Suppose that Pk < X k and assume that 7tq (P) ^ H{ fc }. Then the projection Q = 7rn {fc} (7ro (P)) 
is still in O (by Lemma |A.4[) and it is distinct from ttq (P) (since ttq (P) ^ H{ fe }). We will show 
that 



|P-0|| < ||P-7ro(P) 



(25) 



which contradicts the definition of 7tq (P) . 

In the case of Pk = Xk, as shown in Figure [TJ we have P, Q G Hjj,} and 7tq (P) — Q being 
orthogonal to P — Q. Hence 



|P-Q|| 2 <||P-Q|| 2 



(P)-Q|| 2 = ||P-7To(P)|| 2 . 




R 



Figure 2: P k < X k 



H{fc} 



To establish (l2~5j) in the case P& < Xk, as shown in Figure we introduce a hyperplane M.{ k } 
parallel to H{ fc } by setting 



H {fe} = ix £1™ : z fc =P fe j, 
so that, in particular, P S iui. Let R = ?rg (7ro (P)), so that also 

{ fc} 
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Since Pk < Xf., R & ^{k} and 7tq(P) 6 © \ ^{k} he on opposite sides of the hyperplane M{ k }- It is 
thus clear that 

\\R-Q\\ < ||fl-Q|| + ||Q-7ro(P)|| - ||Jl-7ro(P)||. (26) 
Finally, since P — R is orthogonal to both R — Q and R — ttq (P), we have 

\\P-Q\\ 2 = \\P-R\\ 2 + \\R-Qf 



\P - tto (P)|| 2 = \\P - R\\ 2 + \\R - 7T (P)f . 



and 



Therefore, (|26j) implies (j25j) . as required. □ 

Theorem^JTd^ Begin by noting that in the sub-game Qm' (where M! = Ai\£) defined in Lemma 
13.81 the variables P' k can be rewritten as 

P' k = Pk~ w k {£) - = [n M£ (P) ] k 

according to Proposition ^. 81 The mapping <fi : — > R" l ~l £ l, defined by discarding the coordinates 
with indices in £, is an isometry to the space of Qm' payoffs. Endowed it with the norm 



E 



1 - J2ieM> a i I 



\iEM> V 1 ' ^ieM> ™i 

and let n' be the corresponding projection function. 

Back to the main proof, it is sufficient to show that the strategy profile s* defined by 

s* = ^ [ m (P)].=Xi 

is a Nash equilibrium (hence an optimal equilibrium since Q is WUC). This is done using the same 
induction from the proof of Theorem 13. 71 but with a few additions. The base case of m = 1 can be 
easily checked. 

Consider m > 2. If Pi > Xi for all i, then P lies in the interior of O. So ttq (P) = P and 
s* = 1 is an equilibrium. If Pk < Xk for some fc, consider the m — 1 player subgame Q{_k}- By the 
induction hypothesis, s' G S-k defined by 

s'i=0 <=> KP'j^X, Vze.M\{fc} 

is an equilibrium of G{-k}- Apply the isometry <j> , then Lemma lA.31 

[Ky {!*)].= [7T nH w (7T Hw (i , ))] i = [7r 0nU{k} (P)] v V ie ^\{i}. 

By Lemma [A. 51 Pk < Xk implies ttq (P) S H{ fe }. Hence 7ronH {fc} (P) = (P ) and s' can be 
rewritten as 

[ttoCP)]^^, ViGM\{fc}. 
Finally [Vq (P) ] , = implies s£ = 0, therefore 

s = [s k = 0, s_ fe = s J. 

By the proof of Theorem 13.71 s* must be an equilibrium of Q, as required. □ 
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